Comments on : Augmenting the bootstrap to analyze high dimensional genomic data

نویسندگان

  • Nicole Krämer
  • Svitlana Tyekucheva
  • Francesca Chiaromonte
چکیده

Our first comment is on the optimization of the model parameter τ controlling the amount of noise in the augmented bootstrap method. In a supervised prediction problem, τ can and should be optimized using, e.g., a cross-validation (CV) procedure, as suggested by the authors. If the prediction accuracy is itself evaluated by cross-validation or a related approach, this yields a nested cross-validation procedure involving an inner-loop (in which the parameter is tuned) and an outer-loop (in which the prediction rule with tuned parameter is evaluated), see Statnikov et al (2005) and Boulesteix (2007). Note that different cross-validation schemes can yield different results due to, e.g., the difference in the size of the considered training subsets. In leave-one-out cross-validation, the training subsets have size n−1, whereas they have size n/2 in 2-fold cross-validation. In the case considered here, it is conceivable that cross-validation schemes with many folds (i.e. with large

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comments on: Augmenting the bootstrap to analyze high dimensional genomic data Connections between the augmented bootstrap and the shrinkage covariance estimator

In their enlightening and stimulating paper Svitlana Tyekucheva and Francesca Chiaromonte propose an “augmented bootstrap” (AB) approach to estimate covariance structure in high-dimensional data. They show that the AB estimator performs well in a catalog of examples. Moreover, according to the authors no assumption of a sparsity rationale is made. This is in contrast to a competing and computat...

متن کامل

Comments on: Augmenting the bootstrap to analyze high dimensional genomic data

We congratulate the authors on their interesting article which addresses an important problem in the statistical analysis of high-dimensional data, namely how to estimate the inverse of the population covariance matrix. As the authors have explained, this estimation problem is very challenging with high-dimensional data, as the sample size is generally not large relative to the dimension of the...

متن کامل

Comments on: Control of the false discovery rate under dependence using the bootstrap and subsampling

In this enlightening and stimulating paper, Professors Romano, Shaikh, and Wolf construct two novel resampling-based multiple testing methods using the bootstrap and subsampling techniques and theoretically prove that these methods approximately control the FDR under weak regularity conditions. The theoretical results provide a satisfactory solution to an important and challenging problem in mu...

متن کامل

Comments on: High-dimensional simultaneous inference with the bootstrap

Weprovide comments on the article “High-dimensional simultaneous inference with the bootstrap” by Ruben Dezeure, Peter Buhlmann and Cun-Hui Zhang.

متن کامل

Robust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data

Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variable...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007